Measuring forecast skill: is it real skill or is it the varying climatology?

نویسندگان

  • THOMAS M. HAMILL
  • JOSIP JURAS
چکیده

It is common practice to summarize the skill of weather forecasts from an accumulation of samples spanning many locations and dates. In calculating many of these scores, there is an implicit assumption that the climatological frequency of event occurrence is approximately invariant over all samples. If the event frequency actually varies among the samples, the metrics may report a skill that is different from that expected. Many common deterministic verification metrics, such as threat scores, are prone to mis-reporting skill, and probabilistic forecast metrics such as the Brier skill score and relative operating characteristic skill score can also be affected. Three examples are provided that demonstrate unexpected skill, two from synthetic data and one with actual forecast data. In the first example, positive skill was reported in a situation where metrics were calculated from a composite of forecasts that were comprised of random draws from the climatology of two distinct locations. As the difference in climatological event frequency between the two locations was increased, the reported skill also increased. A second example demonstrates that when the climatological event frequency varies among samples, the metrics may excessively weight samples with the greatest observational uncertainty. A final example demonstrates unexpectedly large skill in the equitable threat score of deterministic precipitation forecasts. Guidelines are suggested for how to adjust skill computations to minimize these effects.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A synoptic-climatology approach to increase the skill of numerical weather predictions over Iran

Simplifications used in regional climate models decrease the accuracy of the regional climate models. To overcome this deficiency, usually a statistical technique of MOS is used to improve the skill of gridded outputs of the Numerical Weather Prediction (NWP) models. In this paper, an experimental synoptic-climatology based method has been used to calibrate, and decrease amount of errors in GFS...

متن کامل

Seasonal predictions of ice extent in the Arctic Ocean

[1] How well can the extent of arctic sea ice be predicted for lead periods of up to one year? The forecast ability of a linear empirical model is explored. It uses as predictors historical information about the ocean and ice obtained from an ice–ocean model retrospective analysis. The monthly model fields are represented by a correlation-weighted average based on the predicted ice extent. The ...

متن کامل

Evaluation of boundary-layer type in a weather forecast model utilising long-term Doppler lidar observations

Many studies evaluating model boundary-layer schemes focus either on near-surface parameters or on short-term observational campaigns. This reflects the observational datasets that are widely available for use in model evaluation. In this paper we show how surface and long-term Doppler lidar observations, combined in a way to match model representation of the boundary layer as closely as possib...

متن کامل

Categorical Climate Forecasts through Regularization and Optimal Combination of Multiple GCM Ensembles

A Bayesian methodology is used to assess the information content of categorical, probabilistic forecasts of specific variables derived from a general circulation model (GCM) forecast ensemble, and to combine a ‘‘prior’’ forecast (climatological probabilities of each category) with a categorical probabilistic forecast derived from a GCM ensemble to develop posterior, or ‘‘regularized’’ categoric...

متن کامل

Sub-seasonal tropical cyclone genesis prediction and MJO in the S2S dataset

Subseasonal probabilistic prediction of tropical cyclone (TC) genesis is investigated here using models from the seasonal to subseasonal (S2S) dataset. Forecasts are produced for basin-wide TC occurrence at weekly temporal resolution. Forecast skill is measured using the Brier skill score relative to two no-skill climatological forecasts: an annual mean climatology that is constant through the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007